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The aim of this research was to evaluate the ability to switch attention and selectively attend to relevant 
information in children (10-15 years) with persistent listening difficulties in noisy environments. A wide 
battery of clinical tests indicated that children with complaints of listening difficulties had otherwise normal 
hearing sensitivity and auditory processing skills. Here we show that these children are markedly slower to 
switch their attention compared to their age-matched peers. The results suggest poor attention switching, 
lack of response inhibition and/or poor listening effort consistent with a predominantly top-down (central) 
information processing deficit. A deficit in the ability to switch attention across talkers would provide the 
basis for this otherwise hidden listening disability, especially in noisy environments involving multiple 
talkers such as classrooms. 

Listening and understanding a single talker in the presence of other talkers or distracters requires adequate 
hearing sensitivity, processing of the spectral (frequency and intensity) and temporal (time) cues, separating 
the information into coherent streams as well as selectively attending to the relevant talker and ignoring the 
distracters 1 . Selective auditory attention and re-orientation in a noisy environment is a basic yet complex 
behavior 2 4 . Most of our listening experiences in the environment are dynamic and the sources as well as 
information change constantly in time and space. Therefore the listener needs to orient attention when there 
is relevant information and rapidly re-orient attention from one stream of information to another as the situation 
demands 2 . 

There have also been suggestions that the deficits in auditory processing skills and speech perception in noise, 
which are most often observed in children who have a history of recurrent otitis media (middle ear infection) with 
effusion (OME) 5 , are associated with poor attention abilities 6 . We have focused our research to study the atten- 
tional mechanisms in such a population in order to gain further insight into the underlying cause of the listening 
difficulties in background noise. 

Auditory attention can involve both top-down and bottom-up processing based on the types and demands of 
a listening task 7 9 . A task which requires a voluntary selection of targets amongst distracters would involve top 
down (cognitive) control, whereas that requiring involuntary focus of attention due to factors such as salience will 
recruit bottom up (sensory) processing resources 10 . From a functional perspective, listening to speech in a noisy 
background requires a listener to remain alert and responsive to relevant cues (intrinsic and phasic alertness), 
orient attentional focus to important or salient signals (orienting) and selectively focus attention on the sounds of 
interest while ignoring the distracters (selective attention) 1112 . In addition, there may also be a need to simulta- 
neously focus attention on two or more signals (divided attention) and/or disengage and switch attentional focus 
between multiple sources of information based on relevance or salience (attention switching/re-orientation) 13,14 . 

Selective auditory attention in the time domain is especially important in situations where speech and noise 
sources overlap in space and where the listeners are required to constantly switch attentional focus in time 15 . A 
number of studies have demonstrated that tone detection in the presence of background masking noises 
improved significantly when the temporal interval of target occurrence was expected or cued 1619 . An extension 
to this notion of temporal selective attention is the time required for the subjects to re-orient their attention after 
attending to the expected time window. This has been studied in vision where the temporal re-orientation time 
varied between 200-500 ms 20 . However to our knowledge, this has yet to be examined in audition. 

In this study, we designed a task to examine the relative roles of top-down and bottom-up control of attention 
and the time taken to re-orient attention in the auditory domain. Based on a combination of the multi-probe 
signal method 21 and Posner's cueing paradigm 22 , this method involved priming a target signal at a specific time 
interval in a stimulus sequence (temporal epoch) by cueing, followed by frequent presentations of the target at the 
cued epoch. This ensured the focus of attention on the expected epoch. To identify attention specific effects, in 
addition to presenting targets at the expected interval, stimuli were also presented infrequently at unexpected 
epochs. Importantly we also allowed for the presentation of catch trials to facilitate bias correction and sensitivity 
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analysis. Here, we used a target identification task involving the 
discrimination and identification of a target syllable from a string 
of five syllables in the presence of a two-talker speech babble (see 
Figure 1). The duration of the five temporal windows (epochs) was 
based on the subject's individual reaction time via button press res- 
ponses in a control experiment. This ensured a correlation between 
attention and reaction, while also providing a means to quantify the 
subject's response accuracy over time and allowed us to model the 
patterns of attentional re-orientation. Additionally, by changing the 
temporal position of the priming cue and the expected epoch 
between the first and last stimulus windows, we were able to gain 
insights into the subject's auditory selective attention abilities. 

Performance benchmarks were collected from two control groups 
of subjects with normal hearing sensitivity and no reported listening 
difficulty (adults and children). We then applied this paradigm on a 
third (experimental) group of children who presented with persistent 
listening difficulties in noise. In particular, apart from parental and 
teacher concerns about their listening difficulties and a concomitant 
medical history of recurrent OME, this last group of children other- 
wise performed similar to the control (children) group when assessed 
using a wide range of clinical tests for hearing sensitivity and auditory 
processing. Apart from the standard test battery recommended by 
the American Academy of Audiology (2010) 23 , they were also exam- 
ined on additional tests (See Table 1) that ruled out deficits in peri- 
pheral hearing, auditory short-term memory, auditory sustained 
attention and auditory processing (See Table 2). 

Previous studies, although using shorter observation time win- 
dows than the current experiment, have demonstrated a reduction 
in sensitivity to targets outside a certain time window around an 
expected epoch 1618 . Detection of the targets occurring earlier than 
expected has been shown to involve involuntary shifts of attention 
requiring bottom-up processing resources; whereas the detection of 
targets later than expected involves voluntary disengagement and 
switch of attention from the expected temporal epoch requiring 



top-down processing resources 910 . Furthermore, we anticipate a 
gradual improvement in sensitivity over time at the unexpected 
epochs following the epoch when a target was expected but not 
presented 24 . We assessed the difference in sensitivity to identify a 
target for expected and unexpected targets presented at the first 
epoch as a measure of selective attention and the time taken to 
relatively recover the sensitivity for the unexpected targets as a mea- 
sure of attention switching. All the participants were tested on 2 
conditions, an "Early" condition in which the target syllable occurred 
frequently (60%) in Epoch 1 and a "Late" condition in which the 
target occurred frequently in last epoch (Epoch 5). That is, for the 
"Early" condition the target syllables occurred infrequently at the 
unexpected epochs (2-5) while the converse was true for the "Late" 
condition where target occurrence at epochs 1-4 was unexpected. 
The "Early" condition allowed us to examine the voluntary attention 
re-orientation mechanisms that are distinct from the involuntary 
attentional processes of the "Late" condition 25 . 

To our knowledge, this is the first investigation and demonstration 
of temporal attentional re-orientation in children. Most impor- 
tantly, these results indicated a significantly longer attentional re- 
orientation time for children who reported with persistent listening 
difficulties and a history of recurrent OME, in contrast to an age 
matched control group. 

Results 

Adults and children with no listening difficulty. Early condition. 
We examined subjects with normal hearing and no reported listening 
difficulties (12 adults and 12 children). We observed several distinct 
patterns in hit rate and false alarm responses. Overall, the hit rates 
at the expected epoch were considerably higher than those at 
the unexpected epochs in children but not for adults (Figure 2, 
blue and green bars). For the children, the hit rates dropped 
substantially immediately after Epoch 1 (from 0.82 ± 0.01 to 0.39 
± 0.07) in the "Early" condition, then gradually improved consistent 




2000 

Time (milliseconds) 



Figure 1 | A time domain view of the stimulus presented in a test trial within an experimental block ("Early" Condition) in which the target was 
presented frequently and cued at the first temporal epoch. (A) - Cue-Tone (2500 Hz); (B) - Target (da) validly cued and occurs in 60% of trials at this 
epoch; (C) - 2 Talker Babble (Female); (D) - Target (da) invalidly cued and occurs in 20% of trials at these epochs. 
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Table 1 Details of assessment measures, skills and tests undertaken in 


the current study to investigate peripheral hearing, auditory 


processing skills as well as auditory memory and attention 




Measures 


Skills 


Tests 


Peripheral Hearing 


Hearing sensitivity 
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Middle ear integrity 


Immittance audiometry 


In 

Spectral Processing 


Frequency Discrimination 
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Temporal Processing 


Temporal resolution 


Gap detection in noise test 56 




Temporal ordering 


nrcn parrern resr 




Temporal envelope processing 


Sinusoidal amplitude modulation (SAM) detection threshold 57 




Temporal fine structure processing 


Low frequency fine structure - Inter aural phase sensitivity (TFS-LF) 5 ^ 
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shifted harmonic discrimination (TFSl) 5 ^ 


Binaural Processinq 


Binaural integration 


Dichotic digits test 23 




Binaural separation 


Binaural masking level difference test 23 




Localization 


Speech localization in presence of 2 talker babble 59 


Auditory Stream Segregation 


Sequential stream segregation 


ABA_ paradigm (temporal coherence boundary) 60 




Spatial stream segregation 


Listening in Spatialized noise test (LiSN-S) 46 


Speech Perception in Noise 


Speech recognition in presence of Spatialized noise. High Cue SRT condition of LISN-S test 46 


Auditory Memory 


Short term and working memory 


Forward and Backward Digit span test 41 


Auditory Attention 


Sustained attention 


Auditory Continous Performance Test 42 


Selective attention and Attention switching 


Test Developed in the current study 



with a reorientation/re-preparation process 24 (Figure 2A, blue 
and green bars), reaching 0.69 ± 0.05 in Epoch 5. This did not 
occur for the adult subjects, where there was no notable drop 
in their hit rates after the expected epoch, maintaining a hit rate 
of 0.72 ± 0.06 at Epoch 2 (0.51 ± 0.01 seconds, see Methods). 
The considerable reduction in hit rate for the normal children 
after the expected epoch coupled with a relatively slow reorien- 
tation time meant that there remained a notable difference in hit 
rate between Epoch 1 and Epoch 5, the last temporal window. In 
order to compare the reorientation time between normal adults 
and children, we extrapolated the hit rates using a simple line 
of best fit (y = 0.089*x+0.22, adjusted R 2 = 0.81) and projected 
that normal children will only regain sensitivity at 3.32 ± 0.08 
seconds with a hit rate of 0.84 ± 0.18 (see Figure 2A). In the 
expected epoch (Epoch 1), there was no notable difference in hit 
rates for the normal adults and children, however, there was a 



considerably higher number of false alarms committed by the 
children (Figure 2B, blue and green): 0.17 ± 0.02 versus 0.06 ± 
0.01 respectively - suggesting a reduction in sensitivity (see below 
for d' analysis). 

Late condition. In the "Late" condition (Figure 3); the hit rates were 
substantially higher for the adults in all epochs with a consistently 
lower false alarm rate. While there was a notable difference in hit 
rates between adults and children in Epoch 5, both groups performed 
above the 75% threshold, with adults reaching 0.94 ± 0.00 and chil- 
dren 0.87 ± 0.01. In comparison between Early and Late conditions, 
the hit rates at the expected epoch showed a similar pattern. There 
was a higher hit rate for adults in the "Late" condition; however, 
a commensurate increase in false alarm rate was also observed, 
suggesting a similar level of sensitivity for target identification at 
the expected epoch for both groups of participants. That was not 



Table 2 | Mean scores with standard errors for auditory processing, memory and attention tests. There was no sign ificant difference between 


the 2 groups for any of the tests (p > 0.05). For the tests marked 


with an asterisk the individua 


scores for normal children were also within the 


previously published 42 ' 43 age based normative data 






Tests 


Normal Children 


Children with Listening Difficulty 


Brief tone frequency discrimination (threshold in Hz) 


100 Hz: 6.15 Hz (0.84) 


100 Hz: 6.54 Hz (1.05) 


1000 Hz: 5.43 Hz (1 .04) 


1000 Hz:6.70 Hz (1.37) 


Gap detection in noise (threshold in ms) 


2.92 ms (0.27) 


3 ms (0.30) 


Pitch pattern* (Percentage correct score) 


Right: 92.21% (2.14) 


Right:91.93% (2.85) 




Left: 92. 76% (1.91) 


Left: 92.2% (1.85) 


SAM detection (threshold in dB) 


4 Hz: -23.79 dB (0.55) 


4 Hz: -22.21 dB(0.78) 




128 Hz: -20.97 dB (0.71) 


128 Hz: -19.59 dB (1.34) 


TFS-LF 58 (Interaural phase difference threshold in degrees) 


44.05 deg (5.05) 


35.23 deg (3.04) 


TFS1 58 (score in Hz) 


24.5 Hz (4.43) 


28.39 Hz (3.41 ) 


Dichotic digits* (Percentage correct score) 


Right: 98.49% (0.89) 


Right:95.23% (1.31) 




Left: 94.81% (2.3) 


Left: 93.24% (1.70) 


Binaural masking level diffference* (difference in dB) 


12 dB (1 .2) 


12.08 dB (0.80) 


Localization (Root mean square lateral and polar angle errors (LAE 


LAE: 1 1.5 Az (0.9) 


PAE: 1 4.25 Az (0.72) 


and PAE) in Azimuth) 


LAE: 1 4.58 Az ( 1.43) 


PAE: 13.08 Az (0.57) 


Sequential stream segregation (temporal coherence boundary in Hz) 


60.92 Hz (9.90) 


80.90 Hz (9.69) 


Spatial stream segregation* (spatial advantage raw score) 
Speech Recognition in Noise* (Signal to masker ratio threshold in dB) 


1 1 .90 (0.70) 


1 1 .44 (0.56) 


SRT: -15 dB (0.6) 


SRT: -13.74 dB (0.78) 


Digit Span* (Forward and backward digit span raw score) 


Forward: 9.33 (0.54) 


Forward: 8.5 (0.59) 




Backward: 7.08 (0.49) 


Backward: 5.57 (0.25) 


Auditory Continous Performance* (raw scores) 


Inattention: 2.25 (0.65) 


Inattention: 4.08 (1.08) 


Impulsivity: 1.08 (0.66) 


Impulsivity: 1.5 (0.59) 




Vigilance: 0.66 (0.28) 


Vigilance: 1 (0.30) 
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Normal Adults |__| Normal Children ^^|Children with Listening Difficulty 



■ ■ ■ y=0.089"x+0.22 ■ - - ■ y=0.037*x+0.12 




1 "Expected" 



Epochs 




1 "Expected" 



Epochs 



Figure 2 | 2A and 2B: Pooled hit and false alarm rates for target identification for the 3 groups in the "Early" Condition across the 5 temporal epochs. 

The blue bars represent the data for the adult participants with no listening difficulty, the green bars for the children with no listening difficulty and 
the red bars for children with listening difficulty. The green and red dashed line in figure 2A are the lines of best fit used to extrapolate the attention 
re-orientation time for children without and with listening difficulties respectively. Error bars represent standard errors of the mean. 



true for children though, where a considerable increase in false alarm 
rate was observed in the Late condition (from 0.13 ± 0.02 to 0.46 ± 
0.03), suggesting a lower level of sensitivity. Similar to the finding in 
the "Early" condition, there was no notable difference between the 
false alarm rates for the unexpected epochs within each of the two 
groups (See Figures 2B and 3B, blue and green bars). 

Selective attention. The hit rate and false alarm results were summar- 
ized with a sensitivity (d') analysis (See Method and Supplement 1 
for further details). We compared the target identification sensitivity 
at the first temporal epoch when the target was expected at that epoch 
("Early" condition) to when it was unexpected ("Late" condition) as 
a measure of temporal selective attention ability 24 and found a not- 
ably higher sensitivity for target identification at the first epoch in the 
"early" condition (Expected) compared to that at the first epoch in 
the "Late" condition (Unexpected) suggesting a marked effect of 
focusing attention selectively on the expected epoch for both groups 
of participants. Further we also compared the sensitivity for iden- 
tifying the target at the expected epochs for both Early and Late 
conditions (Figure 4). In the Early condition, normal adults had a 
sensitivity of 2.49, 95% CI of [2.2 2.75] while normal children were 
considerably lower at 1.87, 95% CI of [1.62 2.11]. In the Late con- 
dition, normal adults had a sensitivity of 2.64, 95% CI of [2.35 2.91], 
with the normal children dropping in sensitivity to 1.23, 95% CI of 
[1.01 1.44]. In summary, sensitivity for the adult population did not 
vary between conditions and was consistently considerably higher 
than that of normal children. However, the converse was true for the 
normal children tested, where we observed a notable decrease in 
sensitivity between the Early and Late conditions. 

Children with listening difficulties. Early condition. This group of 
subjects consisted of 12 children, age-matched (p > 0.05) against the 



children in the control group, who presented with persistent listening 
difficulties especially in a noisy environment (see Methods). 
Consistent with the results from the children in the control group 
and from previous research 1618 , the hit rates for target identification 
was considerably higher when the target syllable occurred at the 
expected epoch and poorer elsewhere for all the participants (See 
Figures 2 A and 3 A, red bars). A comparison between the children 
in the experimental and control group showed no notable difference 
in hit rates at Epoch 1. Substantially more false alarms were 
committed by children in the experimental group (0.41 ± 0.03 
versus 0.17 ± 0.02 (normal children) and 0.06 ± 0.01 (normal 
adults)). 

Similar to the responses in the control group of children, there was 
a substantial drop in pooled hit rate immediately after the expected 
epoch - from 0.79 ± 0.01 (Epoch 1) to 0.16 ± 0.05 (Epoch 2). 
Additionally, hit rates were considerably lower for all the unexpected 
epochs when compared with the control group and the normal 
adults, only reaching 0.30 ± 0.05 at Epoch 5; However, a trend of 
recovery can still be seen, albeit at a much slower rate (Figure 2A). 
Again, we extrapolated the hit rates from Epoch 2 to 5 and estimated 
that the experimental group would have recovered their sensitivity at 
9.47 ± 0.25 seconds, reaching a hit rate of 0.78 ± 0.27; a notable 
increase in duration from the control group. 

Late condition. In the Late condition the hit rate at the expected 
epoch was notably lower for the children in the experimental 
group. Interestingly, the false alarm rates in the expected epoch 
for the experimental group did not vary significantly when com- 
pared with the "Early" condition, maintaining at 0.47 ± 0.01, even 
though a substantial increase was observed for children in the 
control group. 
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5 "Expected" 




5 "Expected" 



Figure 3 | 3A and 3B: Pooled hit and false alarm rates for target identification for the 3 groups in the "Late" Condition across the 5 temporal epochs. 

The blue bars represent the data for the adult participants with no listening difficulty, the green bars for the children with no listening difficulty and 
the red bars for children with listening difficulty. Error bars represent standard errors of the mean. 



3 




"Early" Condition "Late" Condition 

Test Condition 



Figure 4 | Pooled sensitivity (d') for target identification at the expected epochs for the 3 groups for "Early" and "Late" conditions. The blue bars 
represent the data for the adult participants with no listening difficulty, the green bars for the children with no listening difficulty and the red bars for 
children with listening difficulty. Error bars represent the 95% confidence intervals. * - Substantial Difference; NS - Non-substantial difference. 
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Selective attention. Similar to the results obtained in the previous 
experiment, we observed considerably higher sensitivity for target 
identification at the expected epoch compared to the unexpected 
epoch for both groups of children suggesting that similar to the 
children in the control group, the children in the experimental group 
showed an advantage for identifying the target based on its expect- 
ancy. Further comparison of target identification sensitivity between 
the 2 groups at the expected epochs for both "Early" and "Late" 
conditions showed a notably higher sensitivity for the children in 
the control group in the Early condition but there was no substantial 
difference in sensitivity between the two groups in the "Late con- 
dition (See Figure 4). 

Discussion 

In these experiments, we used a modified probe-signal method to 
analyze the rate of recovery in attention and extrapolated the time 
course necessary for subjects to regain attentional focus. In addition 
to segmenting each trial into equal response windows (epochs) based 
on each individual's minimal response times, false alarms were also 
recorded to ensure the validity of the task and to calculate the sens- 
itivity for target identification across the epochs. The catch trials used 
in the task also played an important role of reducing the attentional 
preparation of the listeners for targets presented at the unexpected 
epochs 24 . The role of the catch trials in context of this paradigm 
would be to generate a degree of uncertainty regarding the occur- 
rence of target which may relax the participants' state of preparation 
for responding to the target especially at the unexpected temporal 
epochs 24 . Although the proportion of catch trials used in the current 
study was relatively small (20%), it has been shown previously that 
such dis-preparation due to the presence of catch trials is an 'all or 
none' process, such that even a small percentage of catch trials is 
sufficient for the effect to be observed 24 . By combining a rigid time 
response window with hit rates and false alarm measures we were 
able to derive bias-free sensitivity measures at the expected epochs. 

We also found enhanced sensitivity for target identification at 
expected vs. unexpected epoch indicating an ability to attend selec- 
tively to the target at the expected time interval for all 3 groups of 
participants. From the present data we are unable to compare and 
contrast the magnitude or strength of selective attention abilities 
between these groups. 

While other studies have also shown a reduction in sensitivity after 
the expected epoch, here we also quantified the recovery rate. Such 
recovery can be attributed to the process of "re-preparation" by 
which the listener develops a new state of preparation across the 
unexpected epochs, due to the absence of targets at the expected 
epoch 24 . Despite the temporary disruptions in re-preparation caused 
by the presence of catch trials, the participants gradually recover their 
sensitivity. This recovery would require a goal driven mental effort 
for the listeners 26 involving predominantly top down voluntary con- 
trol of shift in attention across time 25 with possibly some involuntary 
bottom-up processing 10,24 . 

Interestingly, the results showed that our adult participants had an 
essentially flat distribution of hit rates across the five epochs in the 
"Early" condition. Given the similarities in hit rate, the results may 
suggest that adults reoriented faster than could be detected using a 
button press response paradigm. The duration of the epochs was 
tailored individually based on the response times derived from a set 
of control experiments (see Methods), which for adults was 360 ± 
15.56 ms. These results suggest a relatively rapid attention switching 
time in the time domain for adults. This is consistent with earlier 
studies pertaining to the recovery of the "attentional blink" phenom- 
enon in which the detectability of the second target in a dual target 
detection task was substantially reduced if it occurred within 
200-500 ms from the first and improved thereafter 27 . This has been 
observed in audition as well as vision 20-28 and indicates that the 
listeners' attention is captured by the first target and thus unable 



to rapidly switch to the next. While this may provide a tentative 
explanation to the results from the adult population, "attentional 
blink" in the auditory modality has not been examined in children. 

In contrast, the hit rates for normal children dropped significantly 
between Epoch 1 and Epoch 2 - before increasing slowly in Epoch 5. 
A linear extrapolation projected a reorientation time of 3.32 ± 0.08 
seconds to attain parity with the expected epoch, which is signifi- 
cantly slower than adult behavior. To our knowledge, this is the first 
report that quantified the differences in attentional reorientation 
between adults and children and suggests that this process continues 
to mature into adulthood. 

Most significantly, when we applied this testing methodology on a 
cohort of children who reported with persistent listening difficulties 
(Experimental group), their hit rates, false alarm rates and temporal 
re-orientation time were respectively lower and longer than that 
of the normal children and adults. A comparison of sensitivity in 
the expected epochs (Figure 4) clearly demonstrates this trend. 
Interestingly, there was a significant difference in d' between the 
Control and Experimental cohorts only in the "Early" condition, 
with the d' for the Control cohort reduced to the level of the 
Experimental subjects in the "Late" condition. The difference in 
hit rate responses between the cohorts could not account for such 
a drop in sensitivity. Rather, there was a highly significant difference 
in false alarm rates between the expected epochs for the children in 
the Control group, where a much smaller number of false alarms 
were committed in the "Early" condition. This suggests that the 
Experimental cohort were less able to inhibit their responses in the 
expected epoch of the "Early" condition, rather than representing a 
decrease in sensitivity of the Control group. 

The substantially higher false alarm rate in the Experimental 
group at the expected epoch in the "Early" condition may be due 
to a combination of excessive facilitation effect due to a reflexive shift 
of attention at the expected epoch along with poor response inhibi- 
tion 29 . Moreover, the total number of false alarms across the five 
epochs was significantly higher for these subjects, suggesting a 
general reduced ability to avoid responding in a catch trial. 
Previous work has shown that such intentional or voluntary inhib- 
itory control processes are vital in regulating the allocation of atten- 
tion 29,30 . A poor control of response inhibition as observed may also 
be related to poorer working memory capacity 31 . This inability to 
intentionally inhibit the allocation of attention to irrelevant stimuli 
may lead to increased distractibility of these children especially in 
noisy listening environments and may thus partly explain their 
difficulties in listening. 

The results seen with the experimental group are consistent with 
the idea that there are differences in the ability to rapidly shift atten- 
tion in the group of children with persistent listening difficulties 
compared to their age matched peers. This difference could involve 
a combination of top-down and bottom-up processing deficits. 
Listening in a noisy background or in a multi-talker environment, 
like a group discussion, not only requires efficient peripheral hearing, 
auditory processing and memory but also rapid switching of the 
focus of the listeners attention from one talker to another based on 
the changing relevance of information 32,33 . This requires the applica- 
tion of listening effort to attend to the expected as well as unexpected 
sources of information and the ability to inhibit responding to dis- 
tracting stimuli. Deficits in any of these abilities may affect an indi- 
vidual's ability to listen effectively in a noisy or multi- talker situation. 
Previous brain imaging studies indicate the involvement of predo- 
minantly frontal and parietal cortical areas of the brain in attention 
switching, listening effort and response inhibition control which 
further suggests a more central or top down processing deficit in 
the experimental group tested in this study 8,34 36 . 

Previous work involving children with listening difficulties has 
reported variable performance on psychoacoustic tasks meant to 
measure their auditory processing abilities. This has also been 
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attributed to poor auditory attention and is the consistent with the 
findings reported here 37,38 . Interestingly, all the participants in our 
experimental group reported a history of recurrent otitis media. 
While the data does not speak directly to any definitive links with 
OME, it may be that due to the transient disruptions in hearing 
associated with recurrent OME 39,40 the experimental group may have 
learnt to allocate most of their cognitive resources to selectively focus 
attention on expected information and the remaining resources are 
insufficient for them to switch their attention to any unexpected 
stimulus. Future studies are needed to evaluate this hypothesis. 
Further research should also explore whether these attention switch- 
ing 

deficits are specific to the auditory modality or are modality inde- 
pendent general cognitive deficits. 

This work has described an auditory attention switching deficit in 
a group of school-aged children with persistent listening difficulties 
in noisy environments. As the current set of standard clinical tests 
was unable to discriminate this group of listeners from normal con- 
trols, the test reported here may provide a good candidate test for 
children with listening difficulties. As attention switching requires 
predominantly top down control, the data is consistent with the 
suggestion that this deficit represents a more central pathology in 
contrast to a peripheral auditory processing deficit. An aspect of 
considerable interest will be the capacity of training or practice 
regimes to assist in overcoming this deficit. In a similar study, we 
are currently focusing on assessing children diagnosed with an 
auditory processing disorder to determine if it is part of the broader 
spectrum of listening difficulties and to gain insights into its under- 
lying cause. 

Method 

Participants. We examined three groups of participants - 12 normal adults (mean 
age 21.09, SD 3.52), 12 normal children {control group) (mean age 12.5, SD 1.55) and 
12 children with persistent listening difficulties (experimental group) (mean age 
11.38, SD 1.48). All subjects spoke Australian English as their first language, had 
normal hearing sensitivity and did not present with any middle ear pathologies. We 
ruled out any auditory memory, sustained attention or processing deficits for all the 
participants based on a comprehensive clinical test battery 23,41 (See Table 1 and 2). 
The test scores for each of the normal children on the standardized clinical tests were 
within the previously published norms 42 ' 43 . All the children in the experimental group 
presented with persistent listening difficulties that were based on parental, teacher 
and participant reports of concerns regarding their listening abilities especially in 
noisy environments. 

Children with a history or formal diagnosis of attention deficit/hyperactivity 
disorder (ADHD) were excluded from the study. Furthermore, all children were 
tested on the auditory continuous performance task 44 and there was no significant 
difference (See Table 2) between the control and experimental group children. Earlier 
studies have reported the continuous performance task as a screening test for 
ADHD 45 . Interestingly, their medical history also revealed a history of recurrent (>2 
episodes, Mean — 2.91, SD = 0.51) otitis media with effusion between the ages of 2-5 
years that was absent in the control group. Both the control and experimental groups 
were age matched (p > 0.05). Informed consent was obtained from all the partici- 
pants in accordance with procedures approved by the Human Research Ethics 
Committee at Macquarie University. For every child participant, care was taken to 
avoid participant fatigue and loss of motivation 45 by constant positive reinforcements 
and dividing the tests across multiple sessions within a span of 2 weeks. 

The modified Multi-probe signal method. The multi-probe signal method examines 
the allocation of attentional resources by examining a subject's target detection 
sensitivity in expected and unexpected time windows. It focused the subject's 
attention to the expected epoch by 1) presenting an auditory priming cue and 
2) repeated presentations of the target signal, at the primed epoch. Target signals were 
then presented at the unexpected temporal epochs. This allowed us to examine the 
subject's attention reorientation time by comparing the target detection sensitivity 
between the time windows. Here, we examined five temporal epochs with the 
following target presentation ratio: 60% in the expected epoch, 5% in the four 
unexpected epochs and 20% catch trials. All the participants were tested on two 
conditions, an "Early" condition in which Epoch 1 was the expected epoch, where the 
target syllable occurred frequently and a "Late" condition in which Epoch 5 was the 
expected. This allowed us to compare between voluntary endogenous attention 
re -orientation mechanisms ("Early") and involuntary exogenous process ("Late") 9,25 . 

The duration of the epochs was set based on the subject's response time (RT) 
derived from a series of training trials (see below), where 



Epoch duration — RT mean + RTstd 

This ensured a reasonably high level of test difficulty within the motor constraints 
of the participants. The mean inter-stimulus interval for each group of participants 
was: Adult: 360 ± 15.56 ms, Control group: 404.16 ± 13.84 ms, Experimental group: 
410.83 ± 14.3 ms. 

Stimuli and procedure. The experiments were performed in a darkened anechoic 
chamber. Stimuli consisted of five speech syllables from the list (/da/pa/ga/ka/ba) each 
150 ms long spoken in a male voice, presented from a loud speaker (Audience A3) 
located 1 m directly in front of the subjects (0 1 Azimuth). Maskers in the form of female 
"babble speech" were presented from two speakers (Tannoy V6) placed 1 m in front at 
±45°Azimuth at a constant intensity level of 70 dB SPL. The target to masker ratio 
varied between subjects (by varying the intensity of the syllable train) to keep a 75-85% 
target detection threshold. Speech syllable stimuli were used to emulate a natural 
listening environment within the constraints of linguistic load. It has been shown in an 
earlier study that attention can be specifically allocated to a single syllable 47 . The 
stimulus duration was kept short to preserve a narrow listening window 18 . 

Each trial began and ended with 800 ms of masker followed the syllable train that 
was mixed pseudo- randomly, with/da/being the target. A priming cue (100 ms, 
2.5 kHz tone) always preceded the expected epoch by 100 ms (see Figure 1). Previous 
research suggests that such cues may facilitate detection and help orient attention 
involuntarily even if the listener is unaware of its presence 22 . The cue frequency was 
chosen based on the premise that the key differences in the acoustics of the syllables 
used are in the 2nd and 3rd formant transitions and may enhance identification of 
target syllable at the cued epoch 48 . The duration between the cue and the onset of a 
syllable (stimulus onset asynchrony (SOA)) was maintained at 100 ms to avoid the 
phenomenon of inhibition of return at longer SO As which is known to impair the 
speed and accuracy of target identification at the cued epoch 49,50 . Auditory stimulus 
was generated using Matlab (version 2009b, The MathWorks Inc., Natick, 
Massachusetts) on a PC connected to an external sound card (RME FireFace 400). 
The subject's task was to press the response button as quickly as possible when the 
target syllable/ da/ was detected. Subject's also received instantaneous visual feedback 
(a red or green LED light) for correct target and false alarm identifications. A head 
tracker (Intersense IC3) constantly monitored the subject's position to ensure they 
directly faced the front speaker and a TDT System 2 (Tucker Davis Technologies) 
recorded the button press responses. Button presses that occurred 50 ms prior to the 
occurrence of target in a trial were rejected from the analysis with an assumption that 
they were random guesses. 

Each subject participated in two training and test blocks. Each training block con- 
sisted of 25 trials which had both cued and un-cued targets presented in a 
randomized order. They were initially presented at a target to masker ratio of —10 dB 
and subsequently varied in 1 dB steps to reach a hit rate threshold of 75-85% and <40% 
false alarm rate on catch trials 18 . Each test block examined the "Early" and "Late" 
conditions separately and was further divided into two split halves of 60 trials each of 
approximately 5 minutes in duration, short enough to avoid participant fatigue (See 
Appendix 2). The first 10 trials always had the syllable/da/presented at the expected 
epoch (priming trials) to focus the attention of the participant and were excluded from 
subsequent analysis. The remaining 50 trials were presented in a pseudo-random order 
that preserved position of the expected epoch (either "Early" or "Late"). 

Analysis. The results were analyzed using hit and false alarm rates of target 
identification at each epoch, as well as d' analysis at the first epoch. The hit rate was 
the proportion of correct responses, while the false alarm rate was the proportion of 
responses (button presses) in catch trials. The false alarms were assigned to the epochs 
based on the subject's response time 51 . Since the number of trials across the five 
epochs was uneven, the proportion of catch trials allocated to each epoch was based 
on the distribution probability of the targets; i.e., 60% of false alarms would be 
committed in the expected epoch. A control study corroborated this assumption by 
showing that uniform target presentation rates lead to a uniform false alarm 
distribution (see Supplement 2). Subsequent analysis was performed on the pooled hit 
rate and false alarm rates by combining the results across participants in each 
group 18 ' 52 ' 53 . Sensitivity (d') was calculated using the pooled hit and false alarm rates 
only at the first temporal epoch for both conditions in order to assess selective 
attention ability The 95% CI for the sensitivity measures were calculated using 
Miller's approach 54 (see Figure 4). Also, hit or false alarm rate values of 0 or 1 were 
adjusted by l/2n or l-l/2n respectively where n is the number of trials at each epoch 
to compensate for extreme values (i.e. 0 and 1) in the calculation 52 . 

In order to predict the temporal reorientation time, we modeled the hit rate from 
the unexpected epochs with a line of best fit (linear least square interpolation) and 
extrapolated to the epoch at which the hit rate reached 1 standard error of the 
expected epoch (see Figure 4). An adjusted chi-square test was used to calculate the 
goodness of fit. 
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